biotxcIhdlogy 



RAW SEQUENCE LISTING 
teRROR REPORT 




The Biotechnology Systems Branch of the Scientiflc and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 
Sbiircei: 



itlMfroix^bySiW:: ill floPI 



> THEATT^^ V* 
' % PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: ' ; 

i); ijJC A COPY OF THIS PRINTOUT IN YOUR NEXT COlvMlNIGAlIONtTO^THE 

: 595 C^15J?a^^ AND FAJQDW^A COPYX)FTHB PRI^^ ^ ^ V. 

■ ■ ' i v;,;-V^'v;^^/^^^^^ ^ . -■■ w.'v. ^ - ■■ ''\'-'>^/: ^- ^c'-'-}--'^' ^ ^^^^^^ 

'^^^^^ - TO INTERPRETATION, PLEASE CONTACT ROBERT WAX/ 703^^^^ 

^^^^^^^ e-mail help: patin21help@uspto,eov or phone 703-306-411i?; (It Wax)^ 

PATENTIN 3.0 e-mail help: patin3help@uspto,eov or phone 703-306-4119 flfc WaxY 

to REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER : ■ 
VERSION 3,0 PROGRAM . ACCESSIBLE THROUGH THE US. PATENT AND^ 7.-; 
TRADEMARK OFFICE WEBSITE. SEE BELOW: ' ^^^^^^^^^^^^^^ ; ^ 



/ Checker Version 3.0 ' 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in ^ 
compliance with format and content rules. Checker Version 3.0 works for sequence hstings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errcred sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the foUowing address: 

http://www.uspto.gov/web/offices/pac/checker 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH -ALPHA** HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

* WraK>cd Nuclcics The numbcrAcxt at the end of each line "wrapped" down to the next line. This may occur if your Hie 

Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent **wrapping.** 
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J 



_lnvalid Line Ungth The rules require that a line not exceed 72 characters in length, this includes white spaces. 



_Misaligned Amino 
Numbering 

Non- ASCII 



_ Variable Length . 



6 PalentIn 2.0 

"bug" 



7 Skipped Sequences 

(OLD RULES) 



8 Skipped Sequences' 

(NEW RULES) 



Use of n's or Xaa's 



II 
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(NEW RULES) 



Jnvalid <213> 
Response 



Uscof<220> 



Patentin 2.0 
"bug" 



The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission b saved in ASCII text. 

Sequence(s) conUin n's or Xaa*s representing more iha^ one residue. Per Sequence Rules, 

each n orXaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentin version 2.0 has c^ed Afe <220>-<2 23 > section to be missing from amino acid 

sequcnces(s) Normally, Patentin would' automatically. gcricratc this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant '<220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences, 

Sequcnce<s) missing. If intentional, please insert the following lines for each skipped sequence* 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X*' is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION :SEQ ID NO:X: (insert SEQ*ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

Scquence(s) missing. If Intentional, please insert the following lines for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa*s have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>.<223> is MANDATORY if n's or Xaa's arc present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1,823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scicntific'namc (Genus/sp^ies). <220>.<223> section is required when <213> response is Unloiown or 
is Artifictal Sequence 

Sequencc(s) missing the <220> "Feature" and associated numeric identifiers and responses 

Uscof <220> to <223> is MANDATORY if <2I3> "Organism" response is "Artificial Sequence" or 
"Unknown," Please explain source of genetic material in <220> to <223> section. 
(See"FederilRegrster,"06/01/1998, VoI.63.No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentin version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 
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Misuse of n 



^ ri can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to rqjresent 
any value not specifically a nucleotide. 



AMC/MH - Biotechnology Systems Branch - 08/21/2001 



Page 1 of 11 



PCT09 



/ (/h^Z/lo7 Input Set : A:\98015seq.txt 

/ 5kotAte( be S/^'^ Output Set : N:\CRP3\08012001\I673274.raw 



RAW SEQUENCE LISTING DATE: 08/01/2001 

PATENT APPLICATION: US/09/673,274 TIME: 19:08:03 





3 


<110> 




4 


<110> 


w-->. 






w--> 


6 


<110> 




8 


<120> 




10 


<130> 




12 


<140> 


13 


<141> 




15 


<150> 




16 


<151> 




18 


<150> 




19 


<151> 




21 


<160> 




23 


<170> 



APPLICANT: LAMBERTY, MIREILLE 
APPLICANT: BULET, PHILIPPE 
APPLICANT: BROOKHART, GARY 
APPLICANT: HOFFMAN, JULES 

TITLE OF INVENTION: GENE CODING FOR HELIOMICINE, AND USE THEREOF 
FILE REFERENCE: A3 3 59 5 PCT USA 
CURRENT APPLICATION NUMBER: US 09/673,274 
CURRENT FILING DATE: 1999-07-12 
PRIOR APPLICATION NUMBER: PCT/FR99/00843 
PRIOR FILING DATE: 1999-04-12 
PRIOR APPLICATION NUMBER: FR 98 04933 
PRIOR FILING DATE: 1998-04-15 
NUMBER OF SEQ ID NOS : 38 
SOFTWARE: Patentin Ver. 2.1 



Does Not Comply 
Corrected Diskette Needed 



ERRORED SEQUENCES 



LENGTH: 147 OV^vH jlA^^^^ 

TYPE^;^)^/^^ - — ^ / r 7 // ^ 

ORGANtBrrT/fynElietrc construct"^ yyoji Apyj^ (O </t\ tAy^^nJ j^^-u^t^^^t^^^i^ ^/jdjU/y 



25 <210> SEQ ID NO 

26 <211> LENGTHj_147^^^/| ffs^^^ J/i/^f^ 
E--> 27 <212> 

28 <213> 

30 <220> FEATURE 

31 <221> NAME/KEY: CDS 

32 <222> LOCATION: (1)..(147) 

34 <400> SEQUENCE: 1 

35 age ttg gat aaa aga gac aag 
3 6 Ser Leu Asp Lys Arg Asp Lys 
37 1 5 

39 gtc aac tac act agt gac tgc 

40 Val Asn Tyr Thr Ser Asp Cys 

41 20 

43 aag ggt ggc cat tgt gga tec 

44 Lys Gly Gly His Cys Gly Ser 

45 35 

47 acc 147 

48 Thr 

51 <210> SEQ ID NO: 2 

52 <211> LENGTH^169 
E--> 53 <212> TYPE:(^MN 



ttg 


att 


ggc 


age 


tgt 


gtt 


tgg 


ggc 


gee 


48 


Leu 


He 


Gly 
10 


Ser 


Cys 


Val 


Trp 


Gly 
15 


Ala 




aac 


ggc 


gag 


tgc 


aag 


ege 


ege 


ggt 


tac 


96 


Asn Gly 


Glu 


Cys 


Lys 


Arg 


Arg 


Gly 


Tyr 






25 










30 








ttc 


get 


aac 


gtt 


aac 


tgt 


tgg 


tgt 


gaa 


144 


Phe 


Ala 


Asn 


Val 


Asn 


Cys 


Trp 


Cys 


Glu 




40 










45 












54 <213> ORGANISM <^^nthetic construe 

56 <220> FEATURE: ' — ^ " 

57 <221> NAME/KEY: CDS 

58 <222> LOCATION: (1)..(132) 

60 <400> SEQUENCE: 2 

61 gat aag ett ate ggt tec tgc gtg tgg ggt get gtg aac tac act tec 48 

62 Asp Lys Leu He Gly Ser Cys Val Trp Gly Ala Val Asn Tyr Thr Ser 



file://C:\Crf3\Outhold\VsrI673274.htm 
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63 




65 




66 




67 




69 




70 




71 




73 




76 




77 


E--> 


78 




79 




81 




82 




83 




85 




86 




87 




88 




90 




91 




92 




94 




95 




96 




98 




99 




100 




102 




103 




104 




106 




109 




110 


E--> 


111 




112 




114 




115 




116 




118 




119 




120 




121 




123 




124 




125 




127 




128 




129 




RAW SEQUENCE LISTING DATE: 08/01/2001 

PATENT APPLICATION: US/09/673,274 TIME: 19:08:03 

Input Set : A:\98015seq.txt 

Output Set: N:\CRF3\08012001\I673274.raw 

15 10 15 

gat tgc aac ggt gag tgc aag agg agg ggt tac aag ggt ggt cac tgc 96 
Asp Cys Asn Gly Glu Cys Lys Arg Arg Gly Tyr Lys Gly Gly His Cys 

20 25 30 

ggt tec ttc get aac gtg aac tgc tgg tgc gag act tgagagctcg 142 
Gly Ser Phe Ala Asn Val Asn Cys Trp Cys Glu Thr 

35 40 
gcgaggcgaa cgtgtcgacg gatccgg 169 
<210> SEQ ID NO: 3 
<211> LENGTH: 261 
<212> TYPE: 
<213> ORGANI? 
<220> FEATURE: 
<221> NAME/KEY: CDS 
<222> LOCATION: (3)., (224) 
<400> SEQUENCE: 3 

cc atg ggt ttc gtg ctt ttc tct cag ctt cca tct ttc ctt ctt gtg 47 
Met Gly Phe Val Leu Phe Ser Gin Leu Pro Ser Phe Leu Leu Val 
15 10 15 

tct act ctt ctt ctt ttc ctt gtg ate tct cac tct tgc cgt gcc gat 95 
Ser Thr Leu Leu Leu Phe Leu Val lie Ser His Ser Cys Arg Ala Asp 

20 25 30 

aag ctt ate ggt tec tgc gtg tgg ggt get gtg aac tac act tec gat 143 
Lys Leu lie Gly Ser Cys Val Trp Gly Ala Val Asn Tyr Thr Ser Asp 

35 40 45 

tgc aac ggt gag tgc aag agg agg ggt tac aag ggt ggt cac tgc ggt 191 
Cys Asn Gly Glu Cys Lys Arg Arg Gly Tyr Lys Gly Gly His Cys Gly 
50 55 60 

tec ttc get aac gtg aac tgc tgg tgc gag act tgagagctcg gcgaggcgaa 244 
Ser Phe Ala Asn Val Asn Cys Trp Cys Glu Thr 

65 70 
cgtgtcgacg gatccgg 261 
<210> SEQ ID NO: 4 
<211> LENGTH: 120 
<212> TYPE:^DNV^ 

<213> 0RGANl5HT(synthetic construct 
<220> FEATURE 



<221> NAME/KEY 
<222> LOCATION 
<400> SEQUENCE 



CDS 

(12).. (101) 
4 

gcgtegacgc g atg ggt ttc gtg ctt ttc tct cag ctt cca tct ttc ctt 50 
Met Gly Phe Val Leu Phe Ser Gin Leu Pro Ser Phe Leu 
1 5 10 

ctt gtg tct act ctt ctt ctt ttc ctt gtg ate tct cac tct tgc cgt 98 
Leu Val Ser Thr Leu Leu Leu Phe Leu Val lie Ser His Ser Cys Arg 

15 20 25 

get ggagacgega atteacaca 120 
Ala 
30 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/671 ,7^ ^^"^^'^ °8/01/2001 

us./09/673,274 TIME: 19:08:03 

input Set : A,\98015seq. txt 

Output set: N:\CBP3\08012001\l673274.raw 

280 <220> FEATURE: 

281 <221> NAME/KEY: CDS 

282 <222> LOCATION: (7). (204 i 
284 <400> SEQUENCE: ig^"-^^"^^ 

OQc; 4-— J 



-I - - ^ - 2? t^i ti: s s 

i °i - - f J - '^p "I L°i s 

i s z - ?s s - £ - ir 

35 ^ ^^'^ '^iy Glu Cys Lys Arg 

^^^.e |!J - - - -J - . „ J= 

50 ^ ^^"^ Ala Asn Val Asn Cys 



301 tgg tgc gag act tgactcgag 

302 Trp Cys Glu Thr 

303 65 

306 <210> SEQ ID NO: 19 

307 <211> LENGTH: 838 
E--> 308 <212> TYPE:<rS5i^ 



xirniiQ^m ^ — ■ — ~, 

312 <221> NAME/KEY: promoter 

313 <222> LOCATION: (7) r539> 
315 <220> FEATURE- '••(^^2) 

f'l 022> T^rf""^^'- "•"--Structure 

Jl/ <222> LOCATION: ^533^ /s^;ftx 

319 <220> FEATURE. ^"^^ * * ^^^^^ 

320 <221> NAME/KEY: terminator 

323 Inf ^^^^^^ON: (569) (832) 
323 <400> SEQCTENCE: 19 



48 

96 

144 

192 

213 



£^ ac?:^^^^^^^^^ ---- at 

30 g~\- « ca~- t^^-- 

3 2 actgacgaca acaatgaaa^ gaagaagata V^T'^T T^^^"-^^- agacgtaagc 240 
334 acacatgtaa ggtggaaaat gtaaggS^g aSatL^ ttgtgaaaga gacatagagg 300 
336 ccccactact tatcctttta ?att??tccg tatcatt^.^^ tatcacaaag gaatct?a?c 360 
338 aaggaaccaa gttcggcatt tgtgaaaaci aaaaLL.^ f ^^^^^^^t tttcctatat 420 
342 If^^^'l^^ ??atacaact tcagagaaa? ??gtaaa?^^ ^^^^^^^ tattttcttt 480 
342 gcctgaattc gagctcggta ccggatccaa ^^onn^! gtagatctcg attctagaag 540 

346 a.r°\'r' attgaatcct gt?gccgg?c. ^Scgataa? "f^^^^^ tggcaaLal 600 
346 attacgttaa gcatgtaata ai-i-L^=^^ T^tgcgatgat tatcatataa tttctattaa fisn 
348 ttatgattag Igtcccgcaa tSaca^?^ ^^tgcatgac gttatttatg agatJS 720 
350 caaactagga taaatta'tcg cg^gcggS^ "^^fVl aaa at'atagcgcg 78o 

353 <210> SEQ ID NO: 20 ^^^Tgtgt catctatgtt actagatcgg ggatcgat 838 

354 <211> LENGTH: 1036 



://C:\Crf3\Outhold\VsrI673274.htm 



RAW SEQUENCE LISTING DATE- Ofl/m/onni 

PATENT APPLICATION: US/09/673,274 Till. T/.lYo, 

Input Set : Aj\980158eq.txt 
Output Set: N:\CRF3\08012001\I673274.raw 

132 <210> SEQ ID NO: 5 

133 <211> LENGTH: TS 

E--> 134 <212> TYPE:(^n) 

135 <213> ORGANISM :<^7nthetlc construct^ 
137 <400> SEQUENCE: 5 ^ 

III c'StS '"^"""^ cttgtgtot. 60 

143 <210> SEQ ID NO: 6 "^^ 

144 <211> LENGTH: 72 

E--> 145 <212> TYPE<:;^§) -—"^ \ 

146 <213> ORGANISM: d^etic constrict 
148 <400> SEQUENCE: 6 ~ 

"A i:iT.ir,ii ir'"'^' c.c.„.,,, 

154 <210> SEQ ID NO: 7 "72 

155 <211> LENGTH:^ 

E--> 156 <212> TYPE<^>) ^ .-^^.^^^^^ 

157 <213> ORGANISM :(::synthe^ 

159 <400> SEQUENCE: 7" — • 

lit llT^lll^l T,lUTn^ "''"''"^ " 

165 <210> SEQ ID NO: 8 

166 <211> LENGTH:^9 

E--> 167 <212> TYPE<^M5) ' 

168 <213> ORGANISM :Cs?nt'hetic construct^^ 
170 <400> SEQUENCE: 8~ — ^ 

B iiFiFiin- = ~ 

177 <211> LENGTH: 85 
E--> 178 <212> TYPE:<;^N^ 
179 <213> 0RGANISi!11^ 
181 <400> SEQUENCE: g^——— 

III lll^Zl llll^Z'. sr"" " 

187 <210> SEQ ID NO: 10 

188 <211> LENGTH:^^ 
E-"> 189 <212> TYPE:Qd*K ^ 

190 <213> ORGANISM: <s?;^hetlc construct J) 
192 <400> SEQUENCE: l(r^ — • 

1^95 tagagr'" cgcgccttaa ttaaaagctt gcatgcctgc aggtcgactc 60 

198 <210> SEQ ID NO: 11 

199 <211> LENGTH>-^9L5^ 
E--> 200 <212> TYPE:(A^y ^^..^---^ — ^ 

201 <213> ORGANISM: (Jjiithetic construct J 
203 <400> SEQUENCE: lj>^.., ^ 

lit niizi^ — ,c,. <,t.,,t,t,t e« 
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RAW SEQUENCE LISTING DATE: 08/01/2001 

PATENT APPLICATION: US/09/673,274 TIME: 19:08:03 

Input Set : A:\98015seq, txt 

Output Set: N:\CRF3\08012001\I673274.raw 



209 <210> SEQ ID NO: 12 

210 <211> LENGTH: 93 
E--> 211 <212> TYPEj^1s6n 



E--> 



212 <213> ORGANlWi^^ntherflc''^onstruct^ 

214 <400> SEQUENCE: K 

215 ccggcctgaa ccaggtactg aggttgggcc ctcgagcaca cacctaggcg cgccggggcc 60 
217 gcgtttaaac ttaattaagt gtggcctgac tgg 93 

220 <210> SEQ ID NO: 13 

221 <211> LENGTHi.^^ 

222 <212> TYPE<^^^^^ 



223 <213> ORGANISM :(^^nthetic construct^ 
225 <400> SEQUENCE: r3 ' ^ 



^ 

I: (synthetic construct^^ 
1: ^ 




226 ggtctagaat ggcctgcacc aacaacgcca tgagggccct cttcctcctc 50 

229 <210> SEQ ID NO: 14 

230 <211> LENGTH :^^^J5L0 

231 <212> TYPEd^^AD^!) 

232 <213> ORGANISM: ^ 
234 <400> SEQUENCE: 
23 5 ccgaattcgg cgccgtgcac gatgcagaag agcacgagga ggaagagggc 50 

238 <210> SEQ ID NO: 15 

239 <211> LENGTH: ^ 

240 <212> TYPEK^^^X 

241 <213> ORGANISM :/^nt1ietic construct 

243 <220> FEATURE: 

244 <221> NAME/KEY: CDS 

245 <222> LOCATION: (7).. (72) 

247 <400> SEQUENCE: 15 

248 tctaga atg gcc tgc acc aac aac gcc atg agg gcc etc ttc etc etc 48 

249 Met Ala Cys Thr Asn Asn Ala Met Arg Ala Leu Phe Leu Leu 

250 1 5 10 

252 ctg etc ttc tgc ate gtg cae gge gccgaatte 81 

253 Leu Leu Phe Cys lie Val His Gly 

254 15 20 

257 <210> SEQ ID NO: 16 

258 <211> LENGTH: 24 

259 <212> TYPE;/^n) 

260 <213> ORGANlSfflCsytithetie construct^ 

262 <400> SEQUENCE: 

263 gataagctta tcggttcctg cgtg 24 

266 <210> SEQ ID NO: 17 

267 <211> LENGTHi, 32 

268 <212> TYPEjCaDN^ 



269 <213> ORGANISM(^^^synthetdLc''co^ 

271 <400> SEQUENCEr^^TT- ^ 

272 ggetcgagte aagtctcgca ecagcagttc ae 32 

275 <210> SEQ ID NO: 18 

276 <211> LENGTH: jj^ 
E--> 277 <212> TYPE<^^^^ 

278 <213> ORGANISM /Synthetic construct 



^^^synt 
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HAW SEQUENCE LISTING DATE: 08/01/2001 

PATENT APPLICATION: US/09/673,274 TIME: 19:08:03 

Input Set : Aj\98015seq.txt 

Output Set: N:\CRP3\08012001\i673274.raw 

E--> 355 <212> TYPeC^^^ 

356 <213> ORGANISM :C§j^hetlc construct 

358 <220> FEATURE: ~ — ~- 

359 <221> NAME/KEY: promoter 

360 <222> LOCATION: (7).. (532) 

362 <220> FEATURE: 

363 <221> NAME/KEY: CDS 

364 <222> LOCATION: (539).. (736) 

366 <220> FEATURE: 

367 <221> NAME/KEY: terminator 

368 <222> LOCATION: (767) .. (1030) 
370 <400> SEQUENCE: 20 

III aaggtaatta tccaagatgt agcatcaaga atccaatgtt tacgggaaaa 60 

373 actatggaag tattatgtga gctcagcaag aagcagatca atatgcggca catatgcaac 120 
375 ctatgttcaa aaatgaagaa tgtacagata caagatccta tactgccaga atacgaagaa 180 
377 gaatacgtag aaattgaaaa agaagaacca ggcgaagaaa agaatcttga agacgtaagc 240 
379 actgacgaca acaatgaaaa gaagaagata aggtcggtga ttgtgaaaga gacatagagg 300 
381 acacatgtaa ggtggaaaat gtaagggcgg aaagtaacct tatcacaaag gaatcttatc 360 
383 ccccactact tatcctttta tatttttccg tgtcattttt gcccttgagt tttcctatat 420 
385 aaggaaccaa gttcggcatt tgtgaaaaca agaaaaaatt tggtgtaagc tattttcttt 480 
387 gaagtactga ggatacaact tcagagaaat ttgtaagttt gtagatctcg attctaga 538 
III tl gcc etc ttc etc etc gtg etc 586 

390 Met Ala Cys Thr Asn Asn Ala Met Arg Ala Leu Phe Leu Leu Val Leu 

391 1 5 . 2.0 15 

393 ttc tgc ate gtg cac ggc gat aag ctt ate ggt tec tgc gtg tgg ggt 634 

394 Phe Cys lie Val His Gly Asp Lys Leu He Gly Ser Cys Val Trp Gly 

20 25 30 

III t.^ tgc aag agg agg ggt 682 

398 Ala Val Asn Tyr Thr Ser Asp Cys Asn Gly Glu Cys Lys Arg Arg Gly 
35 40 45 

401 tac aag ggt ggt cac tgc ggt tec ttc get aae gtg aac tgc tgg tgc 730 

402 Tyr Lys Gly Gly His Cys Gly Ser Phe Ala Asn Val Asn Cys Trp Cys 

403 50 55 gQ 

405 gag act tgactcgagg gggggecegg taccggatec aatteecgat cgttcaaaca 786 

406 Glu Thr 

407 65 

409 tttggcaata aagtttctta agattgaate ctgttgecgg tettgegatg attatcatat 846 
411 aatttctgtt gaattaegtt aagcatgtaa taattaaeat gtaatgcatg acgttattta 906 
413 tgagatgggt ttttatgatt agagtccege aattatacat ttaatacgcg atagaaaaea 966 
415 aaatatagcg cgcaaactag gataaattat cgegegeggt gtcatctatg ttaetagatc 1026 
417 ggggatcgat ini^ 

420 <210> SEQ ID NO: 21 ^ 

421 <211> LENGTH: 52-^ 

E--> 422 <212> TYPEtCi^N ---s^ 

423 <213> ORGANISM <iynthetic eonstrueO 
425 <400> SEQUENCE: 21 " 

I29 <L1>'Ieq%d"o?'22" '"^^"^^'^ agctgtgttt ggggcgecgt ca 
430 <211> LENGTH: 56 
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RAW SEQUENCE LISTING DATE: 08/01/2001 

PATENT APPLICATION: US/09/673,274 TIME: 19:08:03 

Input Set : A:\98015seq, txt 

Output Set: N:\CRP3\08012001\I673274.raw 




E--> 431 <212> TY^Ei^^^X^ 

432 <213> ORGANISM :Ciwthetic construct 

434 <400> SEQUENCE: 22""- ■ ^ 

435 agtgtagttg acggcgcccc aaacacagct gccaatcaac ttgtctcttt tatcca 56 

438 <210> SEQ ID NO: 23 

439 <211> LENGTinS2 
E--> 440 <212> TYPE:<^^^^^ — ^ 

441 <213> ORGANISM :Cs^thetic construct 

443 <400> SEQUENCE: 23^ — 

444 actacactag tgactgcaac ggcgagtgca agcgccgcgg ttacaagggt gg 52 

447 <210> SEQ ID NO: 24 

448 <211> LENGTH^J2 

E--> 449 <212> TYPE:Cm^) ^ — 

450 <213> ORGANISM :(syntfietic construct 

452 <400> SEQUENCE: 24 — 

453 cacaatggcc acccttgtaa ccgcggcgct tgcactcgcc gttgcagtca ct 52 

456 <210> SEQ ID NO: 25 

457 <211> LENGTH: 5£ 

E-"> 458 <212> TYPE:CA^y ^ 

459 <213> ORGANISM :/syntheticconstruct^ 

461 <400> SEQUENCE: 25- --"'^^^ 

462 ccattgtgga tccttcgcta acgttaactg ttggtgtgaa acctgatagg tcgaca 56 

465 <210> SEQ ID NO: 26 

466 <211> LENGTH£^^^,.&3^ 
E--> 467 <212> TYPEiCMN. 




468 <213> ORGANISM :(:;^nthetic construct^) 
470 <400> SEQUENCE : 2?"^-— ~ — — 



471 gatctgtcga cctatcaggt ttcacaccaa cagttaacgt tagcgaagga tc 52 

474 <210> SEQ ID NO: 27 

475 <211> LENGTH2A2 

E--> 476 <212> TYPEs(^^ 

477 <213> ORGANISM: (synthetlccon struct J) 

479 <400> SEQUENCE: 27-- — -^"^ 

480 gatccttcgc taacgttaac tgttggtgta gaacctgata gg 42 

483 <210> SEQ ID NO: 28 

484 <211> LENGTH: 42 
E--> 485 <212> TYPE:(^)N^ 

486 <213> ORGANIJ 

488 <400> sequence's 

489 tcgacctatc aggttctaca ccaacagtta acgttagcga ag 42 

492 <210> SEQ ID NO: 29 

493 <211> LENGTHji; 
E--> 494 <212> TYPE/^^^ 

495 <213> ORGAnJbH^ synthetic construct 

497 <400> SEQUENCE: 

498 ctagtgactg caacggcgag tgcttgttgc gc 32 

501 <210> SEQ ID NO: 30 

502 <211> LENGTH^^Zfi 
E--> 503 <212> TYP^f^N^ 



TH: 42 

•C^5^— > 

NlSRr^synthetlc * construct^ 
ENCEr-iS . — 
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RAW SEQUENCE LISTING DATE: 08/01/2001 

PATENT APPLICATION: US/09/673,274 TIME: 19:08:03 

Input Set : Aj\98015seq.txt 

Output Set: N:\CRF3\08012001\I673274.raw 

504 <213> ORGANISM :(^nthei:i:c" con struct^ 

506 <400> SEQUENCE :3-&— 

507 gcaacaagca ctcgccgttg cagtca 26 

510 <210> SEQ ID NO: 31 

511 <211> LENGTH: 32 
■> 512 <212> TYPE:CS)N^ 



513 <213> ORGANISM: (synth^^iT^on struct^ 
515 <400> SEQUENCE: 31 ^ 



516 ctagtgactg cgctgctgag tgcaagcggc gc 32 

519 <210> SEQ ID NO: 32 

520 <211> LENGTH: 26 

■> 521 <212> TYPE<;;;Sn^ 

522 <213> ORGANISM :Cs^i€hetic construct J" 

524 <400> SEQUENCE: 32 — — — 

525 gccgcttgca ctcagcagcg cagtca 26 

528 <210> SEQ ID NO: 33 

529 <211> LENGTH :^0 
■> 530 <212> TYPE:C^]^) ^^^..^ 

531 <213> ORGANISM: Csynthetic construct 
533 <400> SEQUENCE: 3T 



534 agcttggata aaagagctgc tgctgctggt agctgtgttt 40 

537 <210> SEQ ID NO: 34 

538 <211> LENGTHj^J^ 

539 <212> TYPE:^nJ 



540 <213> ORGANISM: (synthe€icconstruct 
542 <400> SEQUENCE: J5 ^ " 



543 ggggcgccgt caactaca 28 

546 <210> SEQ ID NO: 35 

547 <211> LENGTH: 22 

E--> 548 <212> TYPE:(^to)^ ^^-^^ 

549 <213> ORGANISM: ^ynthetic construct) 

551 <400> SEQUENCE: J5 

552 ctagtgtagt tgacggcgcc cc 22 

555 <210> SEQ ID NO: 36 

556 <211> LENGTHj^^.3^ 
E--> 557 <212> TYPE:(^^^V_ 

558 <213> ORGANISM <^syntheticcon 

3: 3^ — "--^ 



560 <400> SEQUENCE: 

561 aaacacagct accagcagca gcagctcttt tatcca 36 

564 <210> SEQ ID NO: 37 

565 <211> LENGTH: 32 

566 <212> TYPEtTAM 



567 <213> ORGANlSH^Y^thetic^'con^^ 

569 <400> SEQUENCE : 37^-^ ""^"^ 

570 ctagtgactg cgctgctgag tgcttgttgc gc 32 

573 <210> SEQ ID NO: 38 

574 <211> LENGTH: ^ 

575 <212> TYPE<^^ " 

576 <213> ORGANISM(r^^^syn^ construct 
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RAW SEQUENCE LISTING DATE: 08/01/2001 

PATENT APPLICATION: US/09/673,274 TIME: 19:08:03 

Input Set : Ai\98015seq.txt 

Output Set: N:\CRF3\08012001\I673274.raw 

578 <400> SEQUENCE: 38 

579 gcaacaagca ctcagcagcg cagtca 
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VERIFICATION SUMMARY 
PATENT APPLICATION: 



US/09/673,274 



DATE: 08/01/2001 
TIME: 19:08:04 



Input Set : A:\98015seq.txt 

Output Set: N:\CRF3\08012001\I673274 .raw 



L:4 M:280 W: Numeric Identifier already exists, <110> found multiple times 

L:5 M:280 W: Numeric Identifier already exists, <110> found multiple times 

L:6 M:280 W: Numeric Identifier already exists, <110> found multiple times 

L:13 M:271 C: Current Filing Date differs, Replaced Current Filing Date 

L:27 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:53 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:78 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

ri:lll M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:134 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:145 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:156 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:167 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:178 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:189 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:200 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:211 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:222 M:310 E: (3) Wrong or Missing Sequence Type, TYPE; 

L:231 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:240 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:259 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:268 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:277 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:308 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:355 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:422 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:431 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:440 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:449 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:458 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:467 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:476 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:485 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:494 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:503 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:512 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:521 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:530 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:539 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:548 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:557 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:566 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:575 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 
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